Doubly Robust Imputation of Incomplete Binary Longitudinal Data

نویسندگان

  • Shahab Jolani
  • Stef van Buuren
چکیده

Estimation in binary longitudinal data by using generalized estimating equation (GEE) becomes complicated in the presence of missing data because standard GEEs are only valid under the restrictive missing completely at random assumption. Weighted GEE has therefore been proposed to allow the validity of GEE's under the weaker missing at random assumption. Multiple imputation offers an attractive alternative, by which the incomplete data are pre-processed, and afterwards the standard GEE can be applied to the imputed data. Nevertheless, the imputation methodology requires correct specification of the imputation model. Dual imputation method provides a new way to increase the robustness of imputations with respect to model misspecification. The method involves integrating the so-called doubly robust ideas into the imputation model. Focusing on incomplete binary longitudinal data, we combine DIM and GEE (DIM-GEE) and study the relative performance of the new method in a case study of obesity among children, as well as a simulation study. Citation: Jolani S, van Buuren S (2014) Doubly Robust Imputation of Incomplete Binary Longitudinal Data. J Biomet Biostat 5: 194. doi:10.4172/21556180.1000194 J Biomet Biostat ISSN: 2155-6180 JBMBS, an open access journal Page 2 of 6 Volume 5 • Issue 3 • 1000194 This paper is organized as follows. In GEE for binary longitudinal data, an overview of GEE for analyzing longitudinal binary data is given. MI is briefly outlined in multiple imputation. The new imputation method is presented in dual imputation based GEE. In case study, DIM-GEE is used to analyze a case study of obesity among children and to compare with MI-GEE. A simulation study comparing DIM-GEE with MI-GEE was conducted and results are presented in simulation study. GEE for Binary Longitudinal Data Suppose the random variable Yij denotes a sequence of binary measurements at time j, j = 1,...,N for subject i, i = 1,...,n. The observed value yij is a realization of the binary response variable Yij, and we assume independence across subjects. The focus of this study is on the marginal models that describe the binary outcome vector, given a set of predictor variables. The association structure (correlation among the components) is captured by an assumed model. Let πij denote the marginal probability of observing a ‘success' for subject i at time j, i.e., πij = E(Yij) = P(Yij = 1), (1 ) ij ij ij ij ij Y π ε π π − = − be standardized deviation between the data and the model predictor for subject i at time j, and 1 2 1 2 1 2 3 1 2 3 ( ), ( ),... ij j ij ij ij j j ij ij ij E E ρ ε ε ρ ε ε ε = = be associations among responses. Following Bahadur [15], the model can be represented by 1 1 ( ) ( ) (1 ) , π π − = = ∏ − ij ij y y N i i j ij ij f c y y (1) where yi = (yi1,...,yiN) is a vector of measurements for subject i, and 1 2 1 2 3 1 2 1 2 3 1 2 1 2 3 1... 1 ( ) 1 ... ... , ij ij ij ij ij i ij j e e ij j j e e e i N i iN j j j j j c e e ρ ρ ρ < < < = + + + + ∑ ∑ y

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dual imputation model for incomplete longitudinal data.

Missing values are a practical issue in the analysis of longitudinal data. Multiple imputation (MI) is a well-known likelihood-based method that has optimal properties in terms of efficiency and consistency if the imputation model is correctly specified. Doubly robust (DR) weighing-based methods protect against misspecification bias if one of the models, but not necessarily both, for the data o...

متن کامل

Doubly robust estimates for binary longitudinal data analysis with missing response and missing covariates.

Longitudinal studies often feature incomplete response and covariate data. Likelihood-based methods such as the expectation-maximization algorithm give consistent estimators for model parameters when data are missing at random (MAR) provided that the response model and the missing covariate model are correctly specified; however, we do not need to specify the missing data mechanism. An alternat...

متن کامل

Gaussianization-based quasi-imputation and expansion strategies for incomplete correlated binary responses.

New quasi-imputation and expansion strategies for correlated binary responses are proposed by borrowing ideas from random number generation. The core idea is to convert correlated binary outcomes to multivariate normal outcomes in a sensible way so that re-conversion to the binary scale, after performing multiple imputation, yields the original specified marginal expectations and correlations. ...

متن کامل

International Biometric Society A comparison of methods for analysing incomplete longitudinal binary data

One popular method for analysing correlated binary data is Generalised Estimating Equations (GEE). It is well-known that the validity of this method in its simplest form when the data are incomplete relies on the often implausible assumption of Missing Completely at Random (MCAR). However, there are conditions under which the MCAR assumption can be relaxed to Missing at Random (MAR). Variants o...

متن کامل

چند رویکرد برخورد با مقادیر گمشده‌ متغیرهای کمی و بررسی اثر آنها بر نتایج حاصل از یک کارآزمایی‌ بالینی

Background and Objectives: A major challenge that affects the longitudinal studies is the problem of missing data. Missing in the data may result in the loss of part of the information which reduces the accuracy of the estimator and obtain the results will be biased and inaccurate. Therefore, it is necessary to evaluate the missing data mechanism from a longitudinal research and to consider thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014